Overview
Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 1003 |
| Missing cells | 149 |
| Missing cells (%) | 0.9% |
| Duplicate rows | 3 |
| Duplicate rows (%) | 0.3% |
| Total size in memory | 627.8 KiB |
| Average record size in memory | 640.9 B |
Variable types
| Text | 1 |
|---|---|
| Categorical | 7 |
| Numeric | 7 |
| DateTime | 2 |
gross margin percentage has constant value "4.761904762" | Constant |
| Dataset has 3 (0.3%) duplicate rows | Duplicates |
Branch is highly overall correlated with City | High correlation |
City is highly overall correlated with Branch | High correlation |
Quantity is highly overall correlated with Tax 5% and 3 other fields | High correlation |
Tax 5% is highly overall correlated with Quantity and 4 other fields | High correlation |
Total is highly overall correlated with Quantity and 4 other fields | High correlation |
Unit price is highly overall correlated with Tax 5% and 3 other fields | High correlation |
cogs is highly overall correlated with Quantity and 4 other fields | High correlation |
gross income is highly overall correlated with Quantity and 4 other fields | High correlation |
Customer type has 79 (7.9%) missing values | Missing |
Product line has 43 (4.3%) missing values | Missing |
Quantity has 20 (2.0%) missing values | Missing |
Reproduction
| Analysis started | 2026-01-19 11:49:01.906134 |
|---|---|
| Analysis finished | 2026-01-19 11:49:08.021602 |
| Duration | 6.12 seconds |
| Software version | ydata-profiling vv4.18.0 |
| Download configuration | config.json |
Variables
Invoice ID
Text
| Distinct | 1000 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 997 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | 750-67-8428 |
|---|---|
| 2nd row | 226-31-3081 |
| 3rd row | 631-41-3108 |
| 4th row | 123-19-1176 |
| 5th row | 373-73-7910 |
| Value | Count | Frequency (%) |
| 849-09-3807 | 2 | 0.2% |
| 745-74-0715 | 2 | 0.2% |
| 452-04-8808 | 2 | 0.2% |
| 433-75-6987 | 1 | 0.1% |
| 252-56-2699 | 1 | 0.1% |
| 871-79-8483 | 1 | 0.1% |
| 848-62-7243 | 1 | 0.1% |
| 631-41-3108 | 1 | 0.1% |
| 123-19-1176 | 1 | 0.1% |
| 373-73-7910 | 1 | 0.1% |
| Other values (990) | 990 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 2006 | |
| 2 | 958 | |
| 6 | 954 | |
| 1 | 951 | |
| 8 | 949 | |
| 5 | 930 | |
| 4 | 923 | |
| 3 | 910 | |
| 7 | 899 | |
| 0 | 814 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11033 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| - | 2006 | |
| 2 | 958 | |
| 6 | 954 | |
| 1 | 951 | |
| 8 | 949 | |
| 5 | 930 | |
| 4 | 923 | |
| 3 | 910 | |
| 7 | 899 | |
| 0 | 814 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11033 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| - | 2006 | |
| 2 | 958 | |
| 6 | 954 | |
| 1 | 951 | |
| 8 | 949 | |
| 5 | 930 | |
| 4 | 923 | |
| 3 | 910 | |
| 7 | 899 | |
| 0 | 814 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11033 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| - | 2006 | |
| 2 | 958 | |
| 6 | 954 | |
| 1 | 951 | |
| 8 | 949 | |
| 5 | 930 | |
| 4 | 923 | |
| 3 | 910 | |
| 7 | 899 | |
| 0 | 814 |
Branch
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 56.9 KiB |
| A | |
|---|---|
| B | |
| C |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | A |
|---|---|
| 2nd row | C |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 342 | |
| B | 333 | |
| C | 328 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 342 | |
| b | 333 | |
| c | 328 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 342 | |
| B | 333 | |
| C | 328 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1003 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 342 | |
| B | 333 | |
| C | 328 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1003 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 342 | |
| B | 333 | |
| C | 328 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1003 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 342 | |
| B | 333 | |
| C | 328 |
City
Categorical
High correlation
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 63.4 KiB |
| Yangon | |
|---|---|
| Mandalay | |
| Naypyitaw |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.6450648 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yangon |
|---|---|
| 2nd row | Naypyitaw |
| 3rd row | Yangon |
| 4th row | Yangon |
| 5th row | Yangon |
Common Values
| Value | Count | Frequency (%) |
| Yangon | 342 | |
| Mandalay | 333 | |
| Naypyitaw | 328 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| yangon | 342 | |
| mandalay | 333 | |
| naypyitaw | 328 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1997 | |
| n | 1017 | |
| y | 989 | |
| Y | 342 | 4.5% |
| g | 342 | 4.5% |
| o | 342 | 4.5% |
| M | 333 | 4.3% |
| d | 333 | 4.3% |
| l | 333 | 4.3% |
| N | 328 | 4.3% |
| Other values (4) | 1312 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7668 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1997 | |
| n | 1017 | |
| y | 989 | |
| Y | 342 | 4.5% |
| g | 342 | 4.5% |
| o | 342 | 4.5% |
| M | 333 | 4.3% |
| d | 333 | 4.3% |
| l | 333 | 4.3% |
| N | 328 | 4.3% |
| Other values (4) | 1312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7668 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1997 | |
| n | 1017 | |
| y | 989 | |
| Y | 342 | 4.5% |
| g | 342 | 4.5% |
| o | 342 | 4.5% |
| M | 333 | 4.3% |
| d | 333 | 4.3% |
| l | 333 | 4.3% |
| N | 328 | 4.3% |
| Other values (4) | 1312 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7668 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1997 | |
| n | 1017 | |
| y | 989 | |
| Y | 342 | 4.5% |
| g | 342 | 4.5% |
| o | 342 | 4.5% |
| M | 333 | 4.3% |
| d | 333 | 4.3% |
| l | 333 | 4.3% |
| N | 328 | 4.3% |
| Other values (4) | 1312 |
Customer type
Categorical
Missing
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 79 |
| Missing (%) | 7.9% |
| Memory size | 61.9 KiB |
| Normal | |
|---|---|
| Member |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Member |
|---|---|
| 2nd row | Normal |
| 3rd row | Normal |
| 4th row | Member |
| 5th row | Normal |
Common Values
| Value | Count | Frequency (%) |
| Normal | 470 | |
| Member | 454 | |
| (Missing) | 79 | 7.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| normal | 470 | |
| member | 454 |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 924 | |
| m | 924 | |
| e | 908 | |
| N | 470 | |
| o | 470 | |
| a | 470 | |
| l | 470 | |
| M | 454 | |
| b | 454 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5544 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| r | 924 | |
| m | 924 | |
| e | 908 | |
| N | 470 | |
| o | 470 | |
| a | 470 | |
| l | 470 | |
| M | 454 | |
| b | 454 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5544 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| r | 924 | |
| m | 924 | |
| e | 908 | |
| N | 470 | |
| o | 470 | |
| a | 470 | |
| l | 470 | |
| M | 454 | |
| b | 454 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5544 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| r | 924 | |
| m | 924 | |
| e | 908 | |
| N | 470 | |
| o | 470 | |
| a | 470 | |
| l | 470 | |
| M | 454 | |
| b | 454 |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.000997 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Male |
| 4th row | Male |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 502 | |
| Male | 501 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 502 | |
| male | 501 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1505 | |
| a | 1003 | |
| l | 1003 | |
| F | 502 | 10.0% |
| m | 502 | 10.0% |
| M | 501 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5016 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1505 | |
| a | 1003 | |
| l | 1003 | |
| F | 502 | 10.0% |
| m | 502 | 10.0% |
| M | 501 | 10.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5016 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1505 | |
| a | 1003 | |
| l | 1003 | |
| F | 502 | 10.0% |
| m | 502 | 10.0% |
| M | 501 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5016 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1505 | |
| a | 1003 | |
| l | 1003 | |
| F | 502 | 10.0% |
| m | 502 | 10.0% |
| M | 501 | 10.0% |
Product line
Categorical
Missing
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 43 |
| Missing (%) | 4.3% |
| Memory size | 73.6 KiB |
| Fashion accessories | |
|---|---|
| Electronic accessories | |
| Food and beverages | |
| Sports and travel | |
| Home and lifestyle |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 18.546875 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Health and beauty |
|---|---|
| 2nd row | Electronic accessories |
| 3rd row | Home and lifestyle |
| 4th row | Health and beauty |
| 5th row | Sports and travel |
Common Values
| Value | Count | Frequency (%) |
| Fashion accessories | 172 | |
| Electronic accessories | 165 | |
| Food and beverages | 165 | |
| Sports and travel | 163 | |
| Home and lifestyle | 151 | |
| Health and beauty | 144 | |
| (Missing) | 43 | 4.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| and | 623 | |
| accessories | 337 | |
| fashion | 172 | 6.8% |
| electronic | 165 | 6.5% |
| food | 165 | 6.5% |
| beverages | 165 | 6.5% |
| sports | 163 | 6.4% |
| travel | 163 | 6.4% |
| home | 151 | 5.9% |
| lifestyle | 151 | 5.9% |
| Other values (2) | 288 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2238 | |
| a | 1748 | 9.8% |
| s | 1662 | 9.3% |
| 1583 | 8.9% | |
| o | 1318 | 7.4% |
| c | 1004 | 5.6% |
| r | 993 | 5.6% |
| n | 960 | 5.4% |
| t | 930 | 5.2% |
| i | 825 | 4.6% |
| Other values (15) | 4544 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 17805 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 2238 | |
| a | 1748 | 9.8% |
| s | 1662 | 9.3% |
| 1583 | 8.9% | |
| o | 1318 | 7.4% |
| c | 1004 | 5.6% |
| r | 993 | 5.6% |
| n | 960 | 5.4% |
| t | 930 | 5.2% |
| i | 825 | 4.6% |
| Other values (15) | 4544 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 17805 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 2238 | |
| a | 1748 | 9.8% |
| s | 1662 | 9.3% |
| 1583 | 8.9% | |
| o | 1318 | 7.4% |
| c | 1004 | 5.6% |
| r | 993 | 5.6% |
| n | 960 | 5.4% |
| t | 930 | 5.2% |
| i | 825 | 4.6% |
| Other values (15) | 4544 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 17805 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 2238 | |
| a | 1748 | 9.8% |
| s | 1662 | 9.3% |
| 1583 | 8.9% | |
| o | 1318 | 7.4% |
| c | 1004 | 5.6% |
| r | 993 | 5.6% |
| n | 960 | 5.4% |
| t | 930 | 5.2% |
| i | 825 | 4.6% |
| Other values (15) | 4544 |
Unit price
Real number (ℝ)
High correlation
| Distinct | 938 |
|---|---|
| Distinct (%) | 94.2% |
| Missing | 7 |
| Missing (%) | 0.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 55.764568 |
| Minimum | 10.08 |
|---|---|
| Maximum | 99.96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 10.08 |
|---|---|
| 5-th percentile | 15.275 |
| Q1 | 33.125 |
| median | 55.42 |
| Q3 | 78.085 |
| 95-th percentile | 97.2125 |
| Maximum | 99.96 |
| Range | 89.88 |
| Interquartile range (IQR) | 44.96 |
Descriptive statistics
| Standard deviation | 26.510165 |
|---|---|
| Coefficient of variation (CV) | 0.47539443 |
| Kurtosis | -1.2226701 |
| Mean | 55.764568 |
| Median Absolute Deviation (MAD) | 22.575 |
| Skewness | 0.00017534848 |
| Sum | 55541.51 |
| Variance | 702.78887 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 83.77 | 3 | 0.3% |
| 88.34 | 2 | 0.2% |
| 60.3 | 2 | 0.2% |
| 32.32 | 2 | 0.2% |
| 32.25 | 2 | 0.2% |
| 99.96 | 2 | 0.2% |
| 45.58 | 2 | 0.2% |
| 39.75 | 2 | 0.2% |
| 68.71 | 2 | 0.2% |
| 45.38 | 2 | 0.2% |
| Other values (928) | 975 | |
| (Missing) | 7 | 0.7% |
| Value | Count | Frequency (%) |
| 10.08 | 1 | |
| 10.13 | 1 | |
| 10.16 | 1 | |
| 10.17 | 1 | |
| 10.18 | 1 | |
| 10.53 | 1 | |
| 10.56 | 1 | |
| 10.59 | 1 | |
| 10.69 | 1 | |
| 10.75 | 1 |
| Value | Count | Frequency (%) |
| 99.96 | 2 | |
| 99.92 | 1 | |
| 99.89 | 1 | |
| 99.83 | 1 | |
| 99.82 | 2 | |
| 99.79 | 1 | |
| 99.78 | 1 | |
| 99.73 | 1 | |
| 99.71 | 1 | |
| 99.7 | 1 |
Quantity
Real number (ℝ)
High correlation Missing
| Distinct | 10 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 20 |
| Missing (%) | 2.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.5015259 |
| Minimum | 1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.9246734 |
|---|---|
| Coefficient of variation (CV) | 0.53161131 |
| Kurtosis | -1.216926 |
| Mean | 5.5015259 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.016679454 |
| Sum | 5408 |
| Variance | 8.5537146 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10 | 116 | |
| 1 | 111 | |
| 4 | 108 | |
| 7 | 100 | |
| 5 | 100 | |
| 6 | 96 | |
| 9 | 92 | |
| 2 | 89 | |
| 3 | 89 | |
| 8 | 82 | |
| (Missing) | 20 | 2.0% |
| Value | Count | Frequency (%) |
| 1 | 111 | |
| 2 | 89 | |
| 3 | 89 | |
| 4 | 108 | |
| 5 | 100 | |
| 6 | 96 | |
| 7 | 100 | |
| 8 | 82 | |
| 9 | 92 | |
| 10 | 116 |
| Value | Count | Frequency (%) |
| 10 | 116 | |
| 9 | 92 | |
| 8 | 82 | |
| 7 | 100 | |
| 6 | 96 | |
| 5 | 100 | |
| 4 | 108 | |
| 3 | 89 | |
| 2 | 89 | |
| 1 | 111 |
Tax 5%
Real number (ℝ)
High correlation
| Distinct | 990 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.400368 |
| Minimum | 0.5085 |
|---|---|
| Maximum | 49.65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 0.5085 |
|---|---|
| 5-th percentile | 1.9575 |
| Q1 | 5.89475 |
| median | 12.096 |
| Q3 | 22.5395 |
| 95-th percentile | 39.146 |
| Maximum | 49.65 |
| Range | 49.1415 |
| Interquartile range (IQR) | 16.64475 |
Descriptive statistics
| Standard deviation | 11.715192 |
|---|---|
| Coefficient of variation (CV) | 0.76070857 |
| Kurtosis | -0.097090352 |
| Mean | 15.400368 |
| Median Absolute Deviation (MAD) | 7.518 |
| Skewness | 0.88698241 |
| Sum | 15446.569 |
| Variance | 137.24572 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30.919 | 2 | 0.2% |
| 9.0045 | 2 | 0.2% |
| 4.464 | 2 | 0.2% |
| 10.3635 | 2 | 0.2% |
| 13.188 | 2 | 0.2% |
| 8.377 | 2 | 0.2% |
| 39.48 | 2 | 0.2% |
| 5.803 | 2 | 0.2% |
| 10.326 | 2 | 0.2% |
| 12.57 | 2 | 0.2% |
| Other values (980) | 983 |
| Value | Count | Frequency (%) |
| 0.5085 | 1 | |
| 0.6045 | 1 | |
| 0.627 | 1 | |
| 0.639 | 1 | |
| 0.699 | 1 | |
| 0.767 | 1 | |
| 0.7715 | 1 | |
| 0.775 | 1 | |
| 0.814 | 1 | |
| 0.8875 | 1 |
| Value | Count | Frequency (%) |
| 49.65 | 1 | |
| 49.49 | 1 | |
| 49.26 | 1 | |
| 48.75 | 1 | |
| 48.69 | 1 | |
| 48.685 | 1 | |
| 48.605 | 1 | |
| 47.79 | 1 | |
| 47.72 | 1 | |
| 45.325 | 1 |
Total
Real number (ℝ)
High correlation
| Distinct | 990 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 323.40773 |
| Minimum | 10.6785 |
|---|---|
| Maximum | 1042.65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 10.6785 |
|---|---|
| 5-th percentile | 41.1075 |
| Q1 | 123.78975 |
| median | 254.016 |
| Q3 | 473.3295 |
| 95-th percentile | 822.066 |
| Maximum | 1042.65 |
| Range | 1031.9715 |
| Interquartile range (IQR) | 349.53975 |
Descriptive statistics
| Standard deviation | 246.01903 |
|---|---|
| Coefficient of variation (CV) | 0.76070857 |
| Kurtosis | -0.097090352 |
| Mean | 323.40773 |
| Median Absolute Deviation (MAD) | 157.878 |
| Skewness | 0.88698241 |
| Sum | 324377.95 |
| Variance | 60525.362 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 649.299 | 2 | 0.2% |
| 189.0945 | 2 | 0.2% |
| 93.744 | 2 | 0.2% |
| 217.6335 | 2 | 0.2% |
| 276.948 | 2 | 0.2% |
| 175.917 | 2 | 0.2% |
| 829.08 | 2 | 0.2% |
| 121.863 | 2 | 0.2% |
| 216.846 | 2 | 0.2% |
| 263.97 | 2 | 0.2% |
| Other values (980) | 983 |
| Value | Count | Frequency (%) |
| 10.6785 | 1 | |
| 12.6945 | 1 | |
| 13.167 | 1 | |
| 13.419 | 1 | |
| 14.679 | 1 | |
| 16.107 | 1 | |
| 16.2015 | 1 | |
| 16.275 | 1 | |
| 17.094 | 1 | |
| 18.6375 | 1 |
| Value | Count | Frequency (%) |
| 1042.65 | 1 | |
| 1039.29 | 1 | |
| 1034.46 | 1 | |
| 1023.75 | 1 | |
| 1022.49 | 1 | |
| 1022.385 | 1 | |
| 1020.705 | 1 | |
| 1003.59 | 1 | |
| 1002.12 | 1 | |
| 951.825 | 1 |
Date
Date
| Distinct | 89 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.0 KiB |
| Minimum | 2019-01-01 00:00:00 |
|---|---|
| Maximum | 2019-03-30 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Time
Date
| Distinct | 506 |
|---|---|
| Distinct (%) | 50.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.0 KiB |
| Minimum | 2026-01-19 10:00:00 |
|---|---|
| Maximum | 2026-01-19 20:59:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Payment
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 63.0 KiB |
| Ewallet | |
|---|---|
| Cash | |
| Credit card |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.2053838 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Ewallet |
|---|---|
| 2nd row | Cash |
| 3rd row | Credit card |
| 4th row | Ewallet |
| 5th row | Ewallet |
Common Values
| Value | Count | Frequency (%) |
| Ewallet | 346 | |
| Cash | 346 | |
| Credit card | 311 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ewallet | 346 | |
| cash | 346 | |
| credit | 311 | |
| card | 311 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1003 | |
| l | 692 | |
| e | 657 | |
| t | 657 | |
| C | 657 | |
| r | 622 | |
| d | 622 | |
| E | 346 | 4.8% |
| w | 346 | 4.8% |
| s | 346 | 4.8% |
| Other values (4) | 1279 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 7227 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1003 | |
| l | 692 | |
| e | 657 | |
| t | 657 | |
| C | 657 | |
| r | 622 | |
| d | 622 | |
| E | 346 | 4.8% |
| w | 346 | 4.8% |
| s | 346 | 4.8% |
| Other values (4) | 1279 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 7227 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1003 | |
| l | 692 | |
| e | 657 | |
| t | 657 | |
| C | 657 | |
| r | 622 | |
| d | 622 | |
| E | 346 | 4.8% |
| w | 346 | 4.8% |
| s | 346 | 4.8% |
| Other values (4) | 1279 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 7227 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1003 | |
| l | 692 | |
| e | 657 | |
| t | 657 | |
| C | 657 | |
| r | 622 | |
| d | 622 | |
| E | 346 | 4.8% |
| w | 346 | 4.8% |
| s | 346 | 4.8% |
| Other values (4) | 1279 |
cogs
Real number (ℝ)
High correlation
| Distinct | 990 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 308.00736 |
| Minimum | 10.17 |
|---|---|
| Maximum | 993 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 10.17 |
|---|---|
| 5-th percentile | 39.15 |
| Q1 | 117.895 |
| median | 241.92 |
| Q3 | 450.79 |
| 95-th percentile | 782.92 |
| Maximum | 993 |
| Range | 982.83 |
| Interquartile range (IQR) | 332.895 |
Descriptive statistics
| Standard deviation | 234.30384 |
|---|---|
| Coefficient of variation (CV) | 0.76070857 |
| Kurtosis | -0.097090352 |
| Mean | 308.00736 |
| Median Absolute Deviation (MAD) | 150.36 |
| Skewness | 0.88698241 |
| Sum | 308931.38 |
| Variance | 54898.288 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 618.38 | 2 | 0.2% |
| 180.09 | 2 | 0.2% |
| 89.28 | 2 | 0.2% |
| 207.27 | 2 | 0.2% |
| 263.76 | 2 | 0.2% |
| 167.54 | 2 | 0.2% |
| 789.6 | 2 | 0.2% |
| 116.06 | 2 | 0.2% |
| 206.52 | 2 | 0.2% |
| 251.4 | 2 | 0.2% |
| Other values (980) | 983 |
| Value | Count | Frequency (%) |
| 10.17 | 1 | |
| 12.09 | 1 | |
| 12.54 | 1 | |
| 12.78 | 1 | |
| 13.98 | 1 | |
| 15.34 | 1 | |
| 15.43 | 1 | |
| 15.5 | 1 | |
| 16.28 | 1 | |
| 17.75 | 1 |
| Value | Count | Frequency (%) |
| 993 | 1 | |
| 989.8 | 1 | |
| 985.2 | 1 | |
| 975 | 1 | |
| 973.8 | 1 | |
| 973.7 | 1 | |
| 972.1 | 1 | |
| 955.8 | 1 | |
| 954.4 | 1 | |
| 906.5 | 1 |
gross margin percentage
Categorical
Constant
| Distinct | 1 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 66.7 KiB |
| 4.761904762 |
|---|
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4.761904762 |
|---|---|
| 2nd row | 4.761904762 |
| 3rd row | 4.761904762 |
| 4th row | 4.761904762 |
| 5th row | 4.761904762 |
Common Values
| Value | Count | Frequency (%) |
| 4.761904762 | 1003 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 4.761904762 | 1003 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 2006 | |
| 7 | 2006 | |
| 6 | 2006 | |
| . | 1003 | |
| 1 | 1003 | |
| 9 | 1003 | |
| 0 | 1003 | |
| 2 | 1003 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 11033 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 4 | 2006 | |
| 7 | 2006 | |
| 6 | 2006 | |
| . | 1003 | |
| 1 | 1003 | |
| 9 | 1003 | |
| 0 | 1003 | |
| 2 | 1003 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 11033 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 4 | 2006 | |
| 7 | 2006 | |
| 6 | 2006 | |
| . | 1003 | |
| 1 | 1003 | |
| 9 | 1003 | |
| 0 | 1003 | |
| 2 | 1003 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 11033 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 4 | 2006 | |
| 7 | 2006 | |
| 6 | 2006 | |
| . | 1003 | |
| 1 | 1003 | |
| 9 | 1003 | |
| 0 | 1003 | |
| 2 | 1003 |
gross income
Real number (ℝ)
High correlation
| Distinct | 990 |
|---|---|
| Distinct (%) | 98.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.400368 |
| Minimum | 0.5085 |
|---|---|
| Maximum | 49.65 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 0.5085 |
|---|---|
| 5-th percentile | 1.9575 |
| Q1 | 5.89475 |
| median | 12.096 |
| Q3 | 22.5395 |
| 95-th percentile | 39.146 |
| Maximum | 49.65 |
| Range | 49.1415 |
| Interquartile range (IQR) | 16.64475 |
Descriptive statistics
| Standard deviation | 11.715192 |
|---|---|
| Coefficient of variation (CV) | 0.76070857 |
| Kurtosis | -0.097090352 |
| Mean | 15.400368 |
| Median Absolute Deviation (MAD) | 7.518 |
| Skewness | 0.88698241 |
| Sum | 15446.569 |
| Variance | 137.24572 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30.919 | 2 | 0.2% |
| 9.0045 | 2 | 0.2% |
| 4.464 | 2 | 0.2% |
| 10.3635 | 2 | 0.2% |
| 13.188 | 2 | 0.2% |
| 8.377 | 2 | 0.2% |
| 39.48 | 2 | 0.2% |
| 5.803 | 2 | 0.2% |
| 10.326 | 2 | 0.2% |
| 12.57 | 2 | 0.2% |
| Other values (980) | 983 |
| Value | Count | Frequency (%) |
| 0.5085 | 1 | |
| 0.6045 | 1 | |
| 0.627 | 1 | |
| 0.639 | 1 | |
| 0.699 | 1 | |
| 0.767 | 1 | |
| 0.7715 | 1 | |
| 0.775 | 1 | |
| 0.814 | 1 | |
| 0.8875 | 1 |
| Value | Count | Frequency (%) |
| 49.65 | 1 | |
| 49.49 | 1 | |
| 49.26 | 1 | |
| 48.75 | 1 | |
| 48.69 | 1 | |
| 48.685 | 1 | |
| 48.605 | 1 | |
| 47.79 | 1 | |
| 47.72 | 1 | |
| 45.325 | 1 |
Rating
Real number (ℝ)
| Distinct | 61 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.972682 |
| Minimum | 4 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.0 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4.3 |
| Q1 | 5.5 |
| median | 7 |
| Q3 | 8.5 |
| 95-th percentile | 9.7 |
| Maximum | 10 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.7176469 |
|---|---|
| Coefficient of variation (CV) | 0.24633949 |
| Kurtosis | -1.1512945 |
| Mean | 6.972682 |
| Median Absolute Deviation (MAD) | 1.5 |
| Skewness | 0.009592349 |
| Sum | 6993.6 |
| Variance | 2.9503109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 26 | 2.6% |
| 6.6 | 25 | 2.5% |
| 4.2 | 22 | 2.2% |
| 9.5 | 22 | 2.2% |
| 6.5 | 21 | 2.1% |
| 5 | 21 | 2.1% |
| 6.2 | 21 | 2.1% |
| 8 | 21 | 2.1% |
| 5.1 | 21 | 2.1% |
| 7.6 | 20 | 2.0% |
| Other values (51) | 783 |
| Value | Count | Frequency (%) |
| 4 | 11 | |
| 4.1 | 17 | |
| 4.2 | 22 | |
| 4.3 | 18 | |
| 4.4 | 17 | |
| 4.5 | 17 | |
| 4.6 | 8 | 0.8% |
| 4.7 | 12 | |
| 4.8 | 13 | |
| 4.9 | 18 |
| Value | Count | Frequency (%) |
| 10 | 5 | 0.5% |
| 9.9 | 16 | |
| 9.8 | 19 | |
| 9.7 | 14 | |
| 9.6 | 17 | |
| 9.5 | 22 | |
| 9.4 | 12 | |
| 9.3 | 16 | |
| 9.2 | 16 | |
| 9.1 | 14 |
Interactions
Correlations
| Branch | City | Customer type | Gender | Payment | Product line | Quantity | Rating | Tax 5% | Total | Unit price | cogs | gross income | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Branch | 1.000 | 1.000 | 0.000 | 0.040 | 0.000 | 0.012 | 0.019 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| City | 1.000 | 1.000 | 0.000 | 0.040 | 0.000 | 0.012 | 0.019 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Customer type | 0.000 | 0.000 | 1.000 | 0.000 | 0.057 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Gender | 0.040 | 0.040 | 0.000 | 1.000 | 0.033 | 0.030 | 0.048 | 0.058 | 0.000 | 0.000 | 0.055 | 0.000 | 0.000 |
| Payment | 0.000 | 0.000 | 0.057 | 0.033 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.031 | 0.000 | 0.000 |
| Product line | 0.012 | 0.012 | 0.000 | 0.030 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| Quantity | 0.019 | 0.019 | 0.000 | 0.048 | 0.000 | 0.000 | 1.000 | -0.022 | 0.739 | 0.739 | 0.016 | 0.739 | 0.739 |
| Rating | 0.000 | 0.000 | 0.000 | 0.058 | 0.000 | 0.000 | -0.022 | 1.000 | -0.020 | -0.020 | -0.008 | -0.020 | -0.020 |
| Tax 5% | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.739 | -0.020 | 1.000 | 1.000 | 0.631 | 1.000 | 1.000 |
| Total | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.739 | -0.020 | 1.000 | 1.000 | 0.631 | 1.000 | 1.000 |
| Unit price | 0.000 | 0.000 | 0.000 | 0.055 | 0.031 | 0.000 | 0.016 | -0.008 | 0.631 | 0.631 | 1.000 | 0.631 | 0.631 |
| cogs | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.739 | -0.020 | 1.000 | 1.000 | 0.631 | 1.000 | 1.000 |
| gross income | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.739 | -0.020 | 1.000 | 1.000 | 0.631 | 1.000 | 1.000 |
Missing values
Sample
| Invoice ID | Branch | City | Customer type | Gender | Product line | Unit price | Quantity | Tax 5% | Total | Date | Time | Payment | cogs | gross margin percentage | gross income | Rating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 750-67-8428 | A | Yangon | Member | Female | Health and beauty | 74.69 | 7.0 | 26.1415 | 548.9715 | 1/5/19 | 13:08 | Ewallet | 522.83 | 4.761905 | 26.1415 | 9.1 |
| 1 | 226-31-3081 | C | Naypyitaw | Normal | Female | Electronic accessories | 15.28 | 5.0 | 3.8200 | 80.2200 | 3/8/19 | 10:29 | Cash | 76.40 | 4.761905 | 3.8200 | 9.6 |
| 2 | 631-41-3108 | A | Yangon | Normal | Male | Home and lifestyle | 46.33 | 7.0 | 16.2155 | 340.5255 | 3/3/19 | 13:23 | Credit card | 324.31 | 4.761905 | 16.2155 | 7.4 |
| 3 | 123-19-1176 | A | Yangon | Member | Male | Health and beauty | 58.22 | 8.0 | 23.2880 | 489.0480 | 1/27/19 | 20:33 | Ewallet | 465.76 | 4.761905 | 23.2880 | 8.4 |
| 4 | 373-73-7910 | A | Yangon | Normal | Male | Sports and travel | 86.31 | 7.0 | 30.2085 | 634.3785 | 2/8/19 | 10:37 | Ewallet | 604.17 | 4.761905 | 30.2085 | 5.3 |
| 5 | 699-14-3026 | C | Naypyitaw | Normal | Male | Electronic accessories | 85.39 | 7.0 | 29.8865 | 627.6165 | 3/25/19 | 18:30 | Ewallet | 597.73 | 4.761905 | 29.8865 | 4.1 |
| 6 | 355-53-5943 | A | Yangon | Member | Female | NaN | 68.84 | 6.0 | 20.6520 | 433.6920 | 2/25/19 | 14:36 | Ewallet | 413.04 | 4.761905 | 20.6520 | 5.8 |
| 7 | 315-22-5665 | C | Naypyitaw | Normal | Female | NaN | 73.56 | 10.0 | 36.7800 | 772.3800 | 2/24/19 | 11:38 | Ewallet | 735.60 | 4.761905 | 36.7800 | 8.0 |
| 8 | 665-32-9167 | A | Yangon | Member | Female | NaN | 36.26 | 2.0 | 3.6260 | 76.1460 | 1/10/19 | 17:15 | Credit card | 72.52 | 4.761905 | 3.6260 | 7.2 |
| 9 | 692-92-5582 | B | Mandalay | Member | Female | NaN | 54.84 | 3.0 | 8.2260 | 172.7460 | 2/20/19 | 13:27 | Credit card | 164.52 | 4.761905 | 8.2260 | 5.9 |
| Invoice ID | Branch | City | Customer type | Gender | Product line | Unit price | Quantity | Tax 5% | Total | Date | Time | Payment | cogs | gross margin percentage | gross income | Rating | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 993 | 690-01-6631 | B | Mandalay | Normal | Male | Fashion accessories | NaN | 10.0 | 8.7450 | 183.6450 | 2/22/19 | 18:35 | Ewallet | 174.90 | 4.761905 | 8.7450 | 6.6 |
| 994 | 652-49-6720 | C | Naypyitaw | Member | Female | Electronic accessories | NaN | 1.0 | 3.0475 | 63.9975 | 2/18/19 | 11:40 | Ewallet | 60.95 | 4.761905 | 3.0475 | 5.9 |
| 995 | 233-67-5758 | C | Naypyitaw | Normal | Male | Health and beauty | NaN | 1.0 | 2.0175 | 42.3675 | 1/29/19 | 13:46 | Ewallet | 40.35 | 4.761905 | 2.0175 | 6.2 |
| 996 | 303-96-2227 | B | Mandalay | Normal | Female | Home and lifestyle | NaN | 10.0 | 48.6900 | 1022.4900 | 3/2/19 | 17:16 | Ewallet | 973.80 | 4.761905 | 48.6900 | 4.4 |
| 997 | 727-02-1313 | A | Yangon | Member | Male | Food and beverages | NaN | 1.0 | 1.5920 | 33.4320 | 2/9/19 | 13:22 | Cash | 31.84 | 4.761905 | 1.5920 | 7.7 |
| 998 | 347-56-2442 | A | Yangon | Normal | Male | Home and lifestyle | 65.82 | 1.0 | 3.2910 | 69.1110 | 2/22/19 | 15:33 | Cash | 65.82 | 4.761905 | 3.2910 | 4.1 |
| 999 | 849-09-3807 | A | Yangon | Member | Female | Fashion accessories | 88.34 | 7.0 | 30.9190 | 649.2990 | 2/18/19 | 13:28 | Cash | 618.38 | 4.761905 | 30.9190 | 6.6 |
| 1000 | 849-09-3807 | A | Yangon | Member | Female | Fashion accessories | 88.34 | 7.0 | 30.9190 | 649.2990 | 2/18/19 | 13:28 | Cash | 618.38 | 4.761905 | 30.9190 | 6.6 |
| 1001 | 745-74-0715 | A | Yangon | Normal | Male | Electronic accessories | NaN | 2.0 | 5.8030 | 121.8630 | 3/10/19 | 20:46 | Ewallet | 116.06 | 4.761905 | 5.8030 | 8.8 |
| 1002 | 452-04-8808 | B | Mandalay | Normal | Male | Electronic accessories | 87.08 | NaN | 30.4780 | 640.0380 | 1/26/19 | 15:17 | Cash | 609.56 | 4.761905 | 30.4780 | 5.5 |
Duplicate rows
Most frequently occurring
| Invoice ID | Branch | City | Customer type | Gender | Product line | Unit price | Quantity | Tax 5% | Total | Date | Time | Payment | cogs | gross margin percentage | gross income | Rating | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 452-04-8808 | B | Mandalay | Normal | Male | Electronic accessories | 87.08 | NaN | 30.478 | 640.038 | 1/26/19 | 15:17 | Cash | 609.56 | 4.761905 | 30.478 | 5.5 | 2 |
| 1 | 745-74-0715 | A | Yangon | Normal | Male | Electronic accessories | NaN | 2.0 | 5.803 | 121.863 | 3/10/19 | 20:46 | Ewallet | 116.06 | 4.761905 | 5.803 | 8.8 | 2 |
| 2 | 849-09-3807 | A | Yangon | Member | Female | Fashion accessories | 88.34 | 7.0 | 30.919 | 649.299 | 2/18/19 | 13:28 | Cash | 618.38 | 4.761905 | 30.919 | 6.6 | 2 |